

# UNITED STATES PATENT AND TRADEMARK OFFICE



UNITED STATES DEPARTMENT OF COMMERCE United States Patent and Trademark Office Address: COMMISSIONER FOR PATENTS P.O. Box 1450 Alexandria, Virginia 22313-1450 www.uspto.gov

| APPLICATION NO.                                                                                             | FILING DATE     | FIRST NAMED INVENTOR | ATTORNEY DOCKET NO.     | CONFIRMATION NO. |
|-------------------------------------------------------------------------------------------------------------|-----------------|----------------------|-------------------------|------------------|
| 09/785,143                                                                                                  | 02/16/2001      | Steven Orodon Hobbs  | 2007.013000/TDM         | 4992             |
| 22879                                                                                                       | 7590 01/26/2005 |                      | EXAMINER                |                  |
|                                                                                                             | PACKARD COMPAN  | FOWLKES, ANDRE R     |                         |                  |
| P O BOX 272400, 3404 E. HARMONY ROAD<br>INTELLECTUAL PROPERTY ADMINISTRATION<br>FORT COLLINS, CO 80527-2400 |                 |                      | ART UNIT                | PAPER NUMBER     |
|                                                                                                             |                 |                      | 2122                    |                  |
|                                                                                                             |                 |                      | DATE MAILED: 01/26/2005 |                  |

Please find below and/or attached an Office communication concerning this application or proceeding.

|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Application No.                                                                                                                                                                                                   | Applicant(s)                                         |  |  |  |  |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------|--|--|--|--|
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | 09/785,143                                                                                                                                                                                                        | HOBBS ET AL.                                         |  |  |  |  |
| Office Action Summary                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Examin r                                                                                                                                                                                                          | Art Unit                                             |  |  |  |  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Andre R. Fowlkes                                                                                                                                                                                                  | 2122                                                 |  |  |  |  |
| The MAILING DATE of this communication appears on the cover sheet with the c rrespondence address Period for Reply                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM THE MAILING DATE OF THIS COMMUNICATION.  - Extensions of time may be available under the provisions of 37 CFR 1.136(a). In no event, however, may a reply be timely filed after SIX (6) MONTHS from the mailing date of this communication.  - If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely.  - If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication.  - Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133).  Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any earned patent term adjustment. See 37 CFR 1.704(b). |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| Status                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| 1)⊠ Responsive to communication(s) filed on 14 October 2004.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| <u>_</u>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| ,                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | Since this application is in condition for allowance except for formal matters, prosecution as to the merits is closed in accordance with the practice under <i>Ex parte Quayle</i> , 1935 C.D. 11, 453 O.G. 213. |                                                      |  |  |  |  |
| Disposition of Claims                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| <ul> <li>4)  Claim(s) 1-40 is/are pending in the application.</li> <li>4a) Of the above claim(s) is/are withdrawn from consideration.</li> <li>5)  Claim(s) is/are allowed.</li> <li>6)  Claim(s) 1-40 is/are rejected.</li> <li>7)  Claim(s) is/are objected to.</li> <li>8)  Claim(s) are subject to restriction and/or election requirement.</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| Application Papers                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| 9) The specification is objected to by the Examine 10) The drawing(s) filed on is/are: a) acc Applicant may not request that any objection to the Replacement drawing sheet(s) including the correct 11) The oath or declaration is objected to by the Example.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | epted or b) objected to by the find drawing(s) be held in abeyance. See tion is required if the drawing(s) is obj                                                                                                 | e 37 CFR 1.85(a).<br>jected to. See 37 CFR 1.121(d). |  |  |  |  |
| Priority under 35 U.S.C. § 119                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| <ul> <li>12) Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 119(a)-(d) or (f).</li> <li>a) All b) Some * c) None of:</li> <li>1. Certified copies of the priority documents have been received.</li> <li>2. Certified copies of the priority documents have been received in Application No</li> <li>3. Copies of the certified copies of the priority documents have been received in this National Stage application from the International Bureau (PCT Rule 17.2(a)).</li> <li>* See the attached detailed Office action for a list of the certified copies not received.</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                   |                                                                                                                                                                                                                   |                                                      |  |  |  |  |
| Attachment(s)  1) Notice of References Cited (PTO-892)  2) Notice of Draftsperson's Patent Drawing Review (PTO-948)  3) Information Disclosure Statement(s) (PTO-1449 or PTO/SB/08) Paper No(s)/Mail Date                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 4) Interview Summary Paper No(s)/Mail Da 5) Notice of Informal P 6) Other:                                                                                                                                        |                                                      |  |  |  |  |

Art Unit: 2122

### **DETAILED ACTION**

1. This action is in response to the amendment filed on 10/14/04.

## Claim Rejections - 35 USC § 103

- 2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness rejections set forth in this Office action:
  - (a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are such that the subject matter as a whole would have been obvious at the time the invention was made to a person having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the manner in which the invention was made.
- 3. Claims 1-4, 9-12, 17-19, 24-28 and 33-40 are rejected under 35 U.S.C. 103(a) as being unpatentable over Carr et al. (Carr), "Compiler Optimizations for Improving Data Locality", 1994, ACM, p. 252-262 in view of McGehearty et al., (McGehearty), U.S. Patent No. 6,029,225.

As per claim 1, Carr discloses a method, comprising:

- identifying a loop and each vector memory reference in the loop, in a program (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)"),
- determining dependencies between vector memory references in the loop, including determining unidirectional and circular dependencies (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)"),

Art Unit: 2122

- reducing cache thrashing (p. 252 col. R lines 8-9, "Improve the order of memory accesses to exploit all levels of the memory hierarchy", and exploiting the cache portion of the memory hierarchy, is to use it efficiently, in its designed manner. A cache, operating efficiently in its designed manner of operation, is kept full of the most used memory access locations while cache thrashing is minimized, and p. 252 col. R lines 32-34, "We use the model to derive a loop structure which results in the fewest accesses to main memory (i.e. making the code access the cache and main memory in an efficient manner, thereby reducing cache thrashing)"), by distributing the vector memory references into a plurality of detail loops, wherein the vector memory references that have circular dependencies there between are included in a common detail loop, and wherein the detail loops are ordered according to the unidirectional dependencies between the memory references (p. 253 col. L lines 2-6, "applying compiler transformations based on data dependence (e.g., loop interchange, fusion, distribution, and tiling) to improve paging... In this paper, we ... integrate optimizations for parallelism and memory", and p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)", and p. 256 col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers. To maintain the meaning of the original loop, statements in a recurrence (a cycle in the dependence graph) must be placed in the same (common detail) loop (and the detail loops must be ordered according to the unidirectional dependencies between the memory references)").

Art Unit: 2122

Carr doesn't explicitly disclose allocating the vector memory references into a plurality of temporary arrays, sized and located, so that none of the vector memory references are cache synonyms. Additionally, the limitation "so that none of the vector memory references are cache synonyms" is unclear. The examiner is interpreting this limitation as though any one temporary array cannot contain any two vector memory references that are cache synonyms.

However, McGehearty, in an analogous environment, discloses allocating the vector memory references into a plurality of temporary arrays, sized and located, so that none of the vector memory references are cache synonyms (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a series of loads to get several cache lines staged for loading, each element of data (i.e. vector memory references) is not only moved into the cache, but into registers (i.e. temporary array)... additional loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the system of Carr to have allocating the vector memory references into a plurality of temporary arrays, sized and located, so that none of the vector memory references are cache synonyms. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

Art Unit: 2122

As per claim 2, the rejection of claim 1 is incorporated and further, Carr discloses allocating a plurality of temporary storage areas within a cache and determining the size of each temporary storage area based on the size of the cache and the number of temporary storage areas (p. 252 col. R lines 9-12, "loop ... distribution ... requires knowledge ... of the cache line size", and p. 252 col. R lines 14-15, "Knowledge of the cache size, associativity, and replacement policy is essential", and the optimization technique of loop distribution includes the allocation of a plurality of temporary storage areas within a cache and determining the size of each temporary storage area based on the size of the cache and the number of temporary storage areas).

As per claim 3, the rejection of claim 1 is incorporated and further, Carr discloses a **section loop including the plurality of detail loops** (p. 256 col. L line 48, "Loop distribution").

As per claim 4, the rejection of claim 1 is incorporated and further, Carr discloses distributing the vector memory references into a plurality of detail loops further comprises distributing the vector memory references into a plurality of detail loops that each contain at least one vector memory reference that could benefit from cache management (p. 253 col. L lines 2-6, "applying compiler transformations

Art Unit: 2122

based on data dependence (e.g., loop interchange, fusion, distribution, and tiling) to improve paging").

As per claim 9, Carr discloses a method, comprising:

- identifying a loop and each vector memory reference in the loop, in a program (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)"),
- determining dependencies between vector memory references in the loop, including determining unidirectional and circular dependencies (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)"),
- reducing cache thrashing (p. 252 col. R lines 8-9, "Improve the order of memory accesses to exploit all levels of the memory hierarchy", and exploiting the cache portion of the memory hierarchy, is to use it efficiently, in its designed manner. A cache, operating efficiently in its designed manner of operation, is kept full of the most used memory access locations while cache thrashing is minimized, and p. 252 col. R lines 32-34, "We use the model to derive a loop structure which results in the fewest accesses to main memory (i.e. making the code access the cache and main memory in an efficient manner, thereby reducing cache thrashing)"), by distributing the vector memory references into a plurality of detail loops, wherein the vector memory references that have circular dependencies there between are included in a common detail loop, and wherein the detail loops are ordered according to the

Art Unit: 2122

unidirectional dependencies between the memory references (p. 253 col. L lines 2-6, "applying compiler transformations based on data dependence (e.g., loop interchange, fusion, distribution, and tiling) to improve paging... In this paper, we ... integrate optimizations for parallelism and memory", and p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)", and p. 256 col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers. To maintain the meaning of the original loop, statements in a recurrence (a cycle in the dependence graph) must be placed in the same (common detail) loop (and the detail loops must be ordered according to the unidirectional dependencies between the memory references)").

Carr doesn't explicitly disclose a plurality of detail loops that serially proceed through strips of vector memory references and store the strips in temporary arrays so that none of the vector memory references are cache synonyms.

However, McGehearty, in an analogous environment, discloses a plurality of detail loops that serially proceed through strips of vector memory references and store the strips in temporary arrays so that none of the vector memory references are cache synonyms (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a series of loads (i.e. a serial strip of vector memory reference) to get several cache lines staged for loading, each element of data is not only moved into the cache, but into registers (i.e. temporary array)... additional

loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the system of Carr to have a plurality of detail loops that serially proceed through strips of vector memory references and store the strips in temporary arrays so that none of the vector memory references are cache synonyms. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

As per claims 10-12 the Carr/ McGehearty combination also discloses such claimed limitations as addressed in claims 2-4 above, respectively.

As per claim 17 Carr discloses a method, comprising:

- identifying a loop in a program (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)"),
- identifying each vector memory reference in the loop determining

  dependencies between vector memory references in the loop(p. 253 col. L lines 6162, "data dependence (is determined) between two arrays (vector memory references)
  ... (in a loop)"); and
- reducing cache thrashing (p. 252 col. R lines 8-9, "Improve the order of memory accesses to exploit all levels of the memory hierarchy", and exploiting the

Art Unit: 2122

cache portion of the memory hierarchy, is to use it efficiently, in its designed manner. A cache, operating efficiently in its designed manner of operation, is kept full of the most used memory access locations while cache thrashing is minimized, and p. 252 col. R lines 32-34. "We use the model to derive a loop structure which results in the fewest accesses to main memory (i.e. making the code access the cache and main memory in an efficient manner, thereby reducing cache thrashing)"), by distributing the vector memory references into a plurality of detail loops in response to cache behavior and the dependencies between the vector memory references in the loop (p. 256 col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers", and p. 252 col. R lines 14-15, "Knowledge of

Carr doesn't explicitly disclose storage of vector memory references in temporary arrays that are allocated consecutively so that no temporary array elements are cache synonyms.

the cache size, associativity, and replacement policy (i.e. cache behavior) is essential").

However, McGehearty, in an analogous environment, discloses storage of vector memory references in temporary arrays that are allocated consecutively so that no temporary array elements are cache synonyms (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a series of (consecutive) loads to get several cache lines staged for loading, each element of data is not only moved into the cache, but into registers (i.e. temporary arrays)... additional

Art Unit: 2122

loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the system of Carr to have storage of vector memory references in temporary arrays that are allocated consecutively so that no temporary array elements are cache synonyms. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

As per claim 18, the rejection of claim 17 is incorporated and further, Carr discloses determining dependencies between vector memory references in the loop, and wherein distributing the loop includes distributing the vector memory references into the plurality of detail loops, wherein the vector memory references that have circular dependencies there between are included in a common detail loop (p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)", and p. 256 col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers. To maintain the meaning of the original loop, statements in a recurrence (a cycle in the dependence graph) must be placed in the same loop").

As per claim 19, the rejection of claim 17 is incorporated and further, Carr discloses determining dependencies between vector memory references in the

Art Unit: 2122

loop, and wherein distributing the loop includes distributing the vector memory references into the plurality of detail loops, wherein the vector memory references that have circular dependencies there between are included in a common detail loop (p. 253 col. L line s 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)", and p. 256 col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers. To maintain the meaning of the original loop, statements in a recurrence (a cycle in the dependence graph) must be placed in the same loop").

As per claims 24-28 the Carr/ McGehearty combination also discloses such claimed limitations as addressed in claims 1-3 & 9-12 above.

As per claim 33 Carr discloses a method for reducing the likelihood of cache thrashing by software to be executed on a computer system having a cache (p. 252 col. R lines 8-9, "Improve the order of memory accesses to exploit all levels of the memory hierarchy", and exploiting the cache portion of the memory hierarchy, is to use it efficiently, in its designed manner. A cache, operating efficiently in its designed manner of operation, is kept full of the most used memory access locations while cache thrashing is minimized, and p. 252 col. R lines 32-34, "We use the model to derive a loop structure which results in the fewest accesses to main memory (i.e. making the code access the cache and main memory in an efficient manner, thereby reducing cache thrashing)"), comprising: executing the software on the computer system;

Art Unit: 2122

generating a profile indicating the manner in which the software uses the cache; identifying a portion of the software using the profile data that may experience cache thrashing; and modifying the identified portion of the software to reduce the likelihood of cache thrashing (p. 253 col. L lines 61-62, "data dependence (i.e. a portion of the software that may experience cache thrashing) (is identified) between two arrays ... (in a loop)", and p. 253 col. L lines 2-6, "compiler transformations (are applied) based on data dependence (e.g., loop interchange, fusion, distribution, and tiling) to improve paging (to reduce the likelihood of cache thrashing)... In this paper, we ... integrate optimizations for parallelism and memory").

Carr doesn't explicitly disclose distributing the cache synonyms into detail loops configured to allocate the cache synonyms into temporary storage arrays, sized and located to prevent cache thrashing.

However, McGehearty, in an analogous environment, discloses distributing the cache synonyms into detail loops configured to allocate the cache synonyms into temporary storage arrays, sized and located to prevent cache thrashing. (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a series of loads (i.e. cache synonyms) to get several cache lines staged for loading, each element of data is not only moved into the cache, but into registers (i.e. temporary array)... additional loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Art Unit: 2122

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the system of Carr to distribute the cache synonyms into detail loops configured to allocate the cache synonyms into temporary storage arrays, sized and located to prevent cache thrashing. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

As per claim 34, the rejection of claim 33 is incorporated and further, Carr discloses modifying the identified portion of the software to reduce the likelihood of cache thrashing further comprises: identifying a loop in the identified portion of the software; identifying each vector memory reference in the identified loop; determining dependencies between the vector memory references in the identified loop of the software, including determining unidirectional and circular dependencies; and distributing the vector memory references into a plurality of detail loops, wherein the vector memory references that have circular dependencies there between are included in a common detail loop, and wherein the detail loops are ordered according to the unidirectional dependencies between the memory references (p. 253 col. L lines 2-6, "applying compiler transformations based on data dependence (e.g., loop interchange, fusion, distribution, and tiling) to improve paging", and p. 253 col. L lines 61-62, "data dependence (is determined) between two arrays (vector memory references) ... (in a loop)", and p. 256

Art Unit: 2122

col. L lines 48-51, "Loop distribution separates independent statements in a single loop into multiple loops with identical headers. To maintain the meaning of the original loop, statements in a recurrence (a cycle in the dependence graph) must be placed in the same (detail) loop (and the detail loops must be ordered according to the unidirectional dependencies between the memory references)").

As per claim 35, this is another method version of the claimed method discussed above, in claim 33, wherein all claimed limitations have also been addressed above.

As per claim 36, the rejection of claim 24 is incorporated and further Carr doesn't explicitly disclose that the temporary arrays are allocated consecutively such that no temporary array elements are cache synonyms.

However, McGehearty, in an analogous environment, discloses that the temporary arrays are allocated consecutively such that no temporary array elements are cache synonyms (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a series of loads to get several cache lines staged for loading, each element of data is not only moved into the cache, but into registers (i.e. temporary array)... additional loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the

system of Carr so that that the temporary arrays are allocated consecutively such that no temporary array elements are cache synonyms. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

As per claim 37, the rejection of claim 24 is incorporated and further Carr doesn't explicitly disclose that the detail loops are allocated into section loops that cause iterative execution of the detail loops based on a size of the strips.

However, McGehearty, in an analogous environment, discloses that the detail loops are allocated into section loops that cause iterative execution of the detail loops based on a size of the strips. (col. 2:48-54, "The exact cache collision avoidance mode restructures the loop of moving data to form a (ordered) series of loads to get several cache lines staged for loading, each element of data is not only moved into the cache, but into registers (i.e. temporary array)... additional loads are interleaved with non-cache conflicting stores (i.e. references that are not cache synonyms) to move new values into memory").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of McGehearty into the system of Carr so that that the detail loops are allocated into section loops that cause iterative execution of the detail loops based on a size of the strips. The modification would have been obvious because one of ordinary skill in the art would have wanted to avoid cache collisions (McGehearty, col. 2:48-57).

Art Unit: 2122

As per claims 38-40 the Carr/ McGehearty combination also discloses such claimed limitations as addressed in claims 1, 17 & 36 above.

6. Claims 5-8, 13-16, 20-23 and 29-32 are rejected under 35 U.S.C. 103(a) as being unpatentable over Carr et al. (Carr), "Compiler Optimizations for Improving Data Locality", 1994, ACM, p. 252-262, in view of McGehearty et al., (McGehearty), U.S. Patent No. 6,029,225, further in view of Mahadevan et al. (Mahadevan) U.S. Patent No. 5,797,013.

As per claim 5, the rejection of claim 1 is incorporated and further, the Carr/
McGehearty combination doesn't explicitly disclose inserting cache management
instructions into at least one of said detail loops to control movement of data
associated with the vector memory reference between a cache and main memory.

However, Mahadevan, in an analogous environment, discloses inserting cache management instructions into loops to control movement of data associated with the vector memory reference between a cache and main memory (col. 6 lines 54-55, "(the compiler can) insert prefetches and effect other optimizations (cache management instructions) into the ... loop code").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of Mahadevan into the Carr/ McGehearty combination to have cache management instructions inserted into detail loops to control movement of data associated with the vector memory reference

Art Unit: 2122

between a cache and main memory. The modification would have been obvious because one of ordinary skill in the art would want to compile the code using techniques that will maximize the efficiency of the compiled code's cache usage and therefore overall operation.

As per claim 6, the rejection of claim 1 is incorporated and further, the Carr/
McGehearty combination doesn't explicitly disclose inserting prefetch instructions
into at least one of said detail loops to control movement of data associated with
the vector memory reference between a cache and main memory.

However, Mahadevan, in an analogous environment, discloses inserting prefetch instructions into loops to control movement of data associated with the vector memory reference between a cache and main memory (col. 6 lines 54-55, "(the compiler can) insert prefetches and effect other optimizations into the ... loop code").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of Mahadevan into the Carr/ McGehearty combination to have prefetch instructions inserted into detail loops to control movement of data associated with the vector memory reference between a cache and main memory. The modification would have been obvious because one of ordinary skill in the art would want to compile the code using techniques that will maximize the efficiency of the compiled code's cache usage and therefore overall operation.

Art Unit: 2122

As per claim 7, the rejection of claim 1 is incorporated and further, the Carr/
McGehearty combination doesn't explicitly disclose performing loop unrolling on at
least one of said detail loops to control movement of data associated with the
vector memory reference between a cache and main memory.

However, Mahadevan, in an analogous environment, discloses **performing loop** unrolling on loops to control movement of data associated with the memory reference between a cache and main memory (col. 6 line 27, "the compiler unrolls loops").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of Mahadevan into the Carr/ McGehearty combination to have loop unrolling performed on at least one of said detail loops to control movement of data associated with the vector memory reference between a cache and main memory. The modification would have been obvious because one of ordinary skill in the art would want optimize the performance of the compiled code.

As per claim 8, the rejection of claim 1 is incorporated and further, the Carr/
McGehearty combination doesn't explicitly disclose inserting at least one of a
prefetch instruction and a cache management instruction into at least one of said
detail loops to control movement of data associated with the vector memory
reference between a cache and main memory, and performing loop unrolling on

Art Unit: 2122

at least one of said detail loops to control movement of data associated with the vector memory reference between a cache and main memory.

However, Mahadevan, in an analogous environment, discloses inserting a prefetch instruction and a cache management instruction into loops to control movement of data associated with the memory reference between a cache and main memory, and performing loop unrolling on loops to control movement of data associated with the memory reference between a cache and main memory (col. 6 lines 54-55, "(the compiler can) insert prefetches and effect other optimizations (cache management instructions) into the ... loop code", and col. 6 line 27, "the compiler unrolls loops").

Therefore, it would have been obvious to a person of ordinary skill in the art, at the time the invention was made, to incorporate the teachings of Mahadevan into the Carr/ McGehearty combination to have the insertion of at least one of a prefetch instruction and a cache management instruction into at least one of said detail loops to control movement of data associated with the vector memory reference between a cache and main memory, and performing loop unrolling on at least one of said detail loops to control movement of data associated with the vector memory reference between a cache and main memory. The modification would have been obvious because one of ordinary skill in the art would want to gain the performance advantages provided by using these optimization techniques in combination (Mahadevan, col. 6 line 22 – col. 7 line 29).

Art Unit: 2122

As per claims 13-16, 20-23 and 29-32, the Carr/McGehearty/Mahadevan combination also discloses such claimed limitations as addressed in claims 5-8 above.

## Response to Arguments

- 8. Applicant's arguments with respect to claims 1, 9, 17, 24-26, 33 and 35 have been considered but are moot in view of the new ground(s) of rejection.
- 9. Applicants arguments, with respect to the intended use of the application, have been fully considered but they are not persuasive.

In the remarks, the applicant has argued substantially that:

1) Carr teaches using loop distribution for a specified purpose that is different from the recitation of claims 1, 9, 17, 24-26, 33 and 35, at p. 11:24-12:20 & 12:21-16:3.

#### Examiner's response:

distribution for a specified purpose that is different from the recitation of claims, a recitation of the intended use of the claimed invention must result in a structural difference between the claimed invention and the prior art in order to patentably distinguish the claimed invention from the prior art. If the prior art structure is capable of performing the intended use, then it meets the claim. In a claim drawn to a process of making, the intended use must result in a manipulative difference as compared to the

Art Unit: 2122

prior art. See In re Casey, 152 USPQ 235 (CCPA 1967) and In re Otto, 136 USPQ 458,

Page 3

459 (CCPA 1963).

## Conclusion

10. Any inquiry concerning this communication or earlier communications from the examiner should be directed to Andre R. Fowlkes whose telephone number is (571) 272-3697. The examiner can normally be reached on Monday - Friday, 8:00am-4:30pm.

If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, Tuan Q. Dam can be reached on (571)272-3695. The fax phone number for the organization where this application or proceeding is assigned is 703-872-9306.

Information regarding the status of an application may be obtained from the Patent Application Information Retrieval (PAIR) system. Status information for published applications may be obtained from either Private PAIR or Public PAIR. Status information for unpublished applications is available through Private PAIR only. For more information about the PAIR system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free).

**ARF** 

TUAN DAM BUPERVISORY PATENT EXAMINER